Efficient Anatomical Labeling by Statistical Recombination of Partially Label Datasets
نویسندگان
چکیده
Introduction: Numerous clinically relevant conditions (e.g., degeneration, inflammation, vascular pathology, tramatic injury, cancer, etc.) correlate with volumetric/morphometric changes as observed on MRI. Study of these changes necessitates the ability to label (delineate) voxel-wise classifications for structures of interest. The established gold standard for identifying class memberships is manual voxel-by-voxel labeling by a neuro-anatomist expert, which can be exceptionally time and resource intensive. Furthermore, different human experts often have differing interpretations of ambiguous voxels (on the order of 5-10% of a typical brain structure). Therefore, pursuit of manual approaches is typically limited to either (1) validating automated or semi-automated methods or (2) the study of structures for which no automated method exists. Previously, statistical methods have been presented to simultaneously estimate rater reliability and true labels from complete datasets by several different raters [1-3]. These maximum likelihood/maximum a posteriori methods (i.e., STAPLE) increase the accuracy of a single labeling by combining information from multiple, potentially less accurate raters (so long as the set of raters is independent and unbiased). However, the existing methods require that all raters delineate all voxels, which limits applicability in real research studies where different sets of raters may delineate arbitrary subsets of a population of scans due to the rater availability or the duration of the study. Here, we present Simultaneous Truth and Performance Level Estimation with Robust extensions (STAPLER) to enable use of data with: (1) Missing labels: partial labels sets in which raters do not delineate all voxels; (2) Repeated labels: labels sets in which raters may generate repeated (independent) labels for some (or all) voxels; and (3) Catch trials: label sets in which some raters may have known reliabilities (or some voxels have known true labels). STAPLER simultaneously incorporates all labels from all raters to estimate a maximum a posteriori estimate of both rater reliability and labels.
منابع مشابه
Understanding and Using Patterns of Food Labeling Systems and their Determinants by Medical Students of Tabriz University of Medical Sciences, Iran
Background and Objectives: Increased public knowledge concerning roles of nutrition in prevention of non-communicable diseases have urged people to select healthy foods. The aim of this study was to investigate levels of understanding and use of food labeling systems and their determinants by medical students of Tabriz University of Medical Sciences, Tabriz, Iran. Materials and Methods: In a c...
متن کاملUsing Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise
The growing importance of massive datasets with the advent of deep learning makes robustness to label noise a critical property for classifiers to have. Sources of label noise include automatic labeling for large datasets, non-expert labeling, and label corruption by data poisoning adversaries. In the latter case, corruptions may be arbitrarily bad, even so bad that a classifier predicts the wr...
متن کاملRobust Patch-Based Multi-Atlas Labeling by Joint Sparsity Regularization
Automated labeling of anatomical structures on MR brain images is widely investigated in many neuroscience and clinic studies to quantitatively measure either individual or group structural/functional difference. To address the issue of registration errors that affect multi-atlas based labeling, patch-based label fusion methods have been recently proposed by inspecting the similarity of each pa...
متن کاملiVIBRATE: Interactive Visualization Based Framework for Clustering Large Datasets (Version 3)
With continued advances in communication network technology and sensing technology, there is an astounding growth in the amount of data produced and made available through the cyberspace. Efficient and high-quality clustering of large datasets continues to be one of the most important problems in largescale data analysis. A commonly used methodology for cluster analysis on large datasets is the...
متن کاملA New Method for Solving Constraint Satisfaction Problems
Many important problems in Artificial Intelligence can be defined as Constraint Satisfaction Problems (CSP). These types of problems are defined by a limited set of variables, each having a limited domain and a number of Constraints on the values of those variables (these problems are also called Consistent Labeling Problems (CLP), in which “Labeling" means assigning a value to a variable.) Sol...
متن کامل